AITopics | dynamic incentive-aware learning

Dynamic Incentive-Aware Learning: Robust Pricing in Contextual Auctions

Neural Information Processing SystemsDec-25-2025, 13:46:45 GMT

Motivated by pricing in ad exchange markets, we consider the problem of robust learning of reserve prices against strategic buyers in repeated contextual second-price auctions. Buyers' valuations \new{for} an item depend on the context that describes the item. However, the seller is not aware of the relationship between the context and buyers' valuations, i.e., buyers' preferences. The seller's goal is to design a learning policy to set reserve prices via observing the past sales data, and her objective is to minimize her regret for revenue, where the regret is computed against a clairvoyant policy that knows buyers' heterogeneous preferences. Given the seller's goal, utility-maximizing buyers have the incentive to bid untruthfully in order to manipulate the seller's learning policy.

dynamic incentive-aware learning, name change, robust pricing, (5 more...)

Neural Information Processing Systems

Industry: Marketing (0.59)

Technology:

Information Technology > Game Theory (0.84)
Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Reviews: Dynamic Incentive-Aware Learning: Robust Pricing in Contextual Auctions

Neural Information Processing SystemsMay-31-2025, 22:03:01 GMT

The authors study the problem of setting (individual) reserve prices in a scenario of repeated contextual second-price auctions. The buyers are assumed strategic, i.e. they optimize a cumulative discounted utility, where their valuations are linear functions of the feature vector of a good. The considered scenario explicitly assumes existence of noise in the market. The seller's goal is to find an algorithm for setting prices that has sub-linear regret. Two algorithms are proposed: - the first one attain O(d log(Td) log(T)) regret bound, when the market noise distribution is known to the seller.

algorithm, contextual auction, dynamic incentive-aware learning, (8 more...)

Neural Information Processing Systems

Genre: Research Report (0.33)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.39)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.39)

Add feedback

Dynamic Incentive-Aware Learning: Robust Pricing in Contextual Auctions

Neural Information Processing SystemsOct-10-2024, 07:09:58 GMT

Motivated by pricing in ad exchange markets, we consider the problem of robust learning of reserve prices against strategic buyers in repeated contextual second-price auctions. Buyers' valuations ew{for} an item depend on the context that describes the item. However, the seller is not aware of the relationship between the context and buyers' valuations, i.e., buyers' preferences. The seller's goal is to design a learning policy to set reserve prices via observing the past sales data, and her objective is to minimize her regret for revenue, where the regret is computed against a clairvoyant policy that knows buyers' heterogeneous preferences. Given the seller's goal, utility-maximizing buyers have the incentive to bid untruthfully in order to manipulate the seller's learning policy.

contextual auction, dynamic incentive-aware learning, robust pricing, (3 more...)

Neural Information Processing Systems

Industry: Marketing (0.60)

Technology:

Information Technology > Game Theory (0.87)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Dynamic Incentive-Aware Learning: Robust Pricing in Contextual Auctions

Golrezaei, Negin, Javanmard, Adel, Mirrokni, Vahab

Neural Information Processing SystemsMar-19-2020, 00:32:07 GMT

Motivated by pricing in ad exchange markets, we consider the problem of robust learning of reserve prices against strategic buyers in repeated contextual second-price auctions. Buyers' valuations ew{for} an item depend on the context that describes the item. However, the seller is not aware of the relationship between the context and buyers' valuations, i.e., buyers' preferences. The seller's goal is to design a learning policy to set reserve prices via observing the past sales data, and her objective is to minimize her regret for revenue, where the regret is computed against a clairvoyant policy that knows buyers' heterogeneous preferences. Given the seller's goal, utility-maximizing buyers have the incentive to bid untruthfully in order to manipulate the seller's learning policy.

contextual auction, dynamic incentive-aware learning, robust pricing, (3 more...)

Neural Information Processing Systems

Industry: Marketing (0.60)

Technology:

Information Technology > Game Theory (0.87)
Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Dynamic Incentive-aware Learning: Robust Pricing in Contextual Auctions

Golrezaei, Negin, Javanmard, Adel, Mirrokni, Vahab

arXiv.org Machine LearningFeb-25-2020

Motivated by pricing in ad exchange markets, we consider the problem of robust learning of reserve prices against strategic buyers in repeated contextual second-price auctions. Buyers' valuations for an item depend on the context that describes the item. However, the seller is not aware of the relationship between the context and buyers' valuations, i.e., buyers' preferences. The seller's goal is to design a learning policy to set reserve prices via observing the past sales data, and her objective is to minimize her regret for revenue, where the regret is computed against a clairvoyant policy that knows buyers' heterogeneous preferences. Given the seller's goal, utility-maximizing buyers have the incentive to bid untruthfully in order to manipulate the seller's learning policy. We propose learning policies that are robust to such strategic behavior. These policies use the outcomes of the auctions, rather than the submitted bids, to estimate the preferences while controlling the long-term effect of the outcome of each auction on the future reserve prices. When the market noise distribution is known to the seller, we propose a policy called Contextual Robust Pricing (CORP) that achieves a T-period regret of $O(d\log(Td) \log (T))$, where $d$ is the dimension of {the} contextual information. When the market noise distribution is unknown to the seller, we propose two policies whose regrets are sublinear in $T$.

dynamic incentive-aware learning, reserve price, robust pricing, (13 more...)

arXiv.org Machine Learning

2002.11137

Country: